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Abstract. These proceedings describe Sleuth, a quasi-model-independent search strategy target- 
ing new electroweak scale physics, and its application to 927 pb~^ of CDF II data. Exclusive final 
states are analyzed for an excess of data beyond the Standard Model prediction at large summed 
scalar transverse momentum (J^Pt)- This analysis of high-^pr data represents one of the most 
encompassing searches so far conducted for new physics at the energy frontier. 

PACS. 12.60.-i Models beyond the standard model 



1 Introduction 

Searching a large subset of collision events recorded 
at the energy frontier for evidence of new physics re- 
quires understanding the Standard Model prediction 
and detector response. To facilitate understanding, de- 
bris from collision events are reduced to 4- vectors of re- 
constructed physics objects: electrons (e), muons (fi), 
taus (r), photons (7), jets (j), and jets tagged as con- 
taining a bottom quark (6). Existing event generators 
are used to construct the Standard Model prediction, 
and the response of the CDF detector to these events 
is estimated using a GEANT-based simulation of the de- 
tector. A correction model is developed to correct pos- 
sible inadequacies in the Standard Model prediction 
and modeling of the detector response. These proce- 
dures form the basis of ViSTA, described in a com- 
panion proceedings in this conference [1]. Having ob- 
tained a global Standard Model background estimate. 
Sleuth is used to target new physics likely to appear 
as an excess of data beyond Standard Model prediction 
at large summed scalar transverse momentum. Simi- 
lar search strategies have been developed previously at 
D0 [2113114] and HI 51. 

2 The Method 

Sleuth is slightly less model-independent search strat- 
egy than ViSTA, as it invokes three additional assump- 
tions. The three additional assumptions motivating 
Sleuth are that new physics will likely appear 

1. as an excess of data over the Standard Model ex- 
pectation, 

2. at large J^Pt with respect to the Standard Model 
background, and 
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3. predominantly in one Sleuth final state. 
Sleuth is expected to be sensitive to any particular 
incarnation of new physics to the extent that the above 
criteria are satisfied. 

Following these assumptions. Sleuth searches for 
statistically significant excesses of data in the high- 
^ Pt tails of all high-pT final states with non-zero 
Standard Model expectation, or where data are ob- 
served. Final states are defined so as to concentrate 
more of the signal of possible new physics. Final states 
equivalent under a global switch of the two lesser gen- 
erations (such as e~^e~ and are merged (into 
£~^£~). Final states equivalent under global charge con- 
jugation (such as £~^£'^ and £~£~) are also merged. 
Since new physics is expected to produce light quarks 
(j) and bottom quarks (b) in pairs, each event is said 
to contain n jet pairs if the number of reconstructed 
jets is 2?! or 2n + 1, and is said to contain m b-jet pairs 
if the number of reconstructed 6-jets is 2m, or 2m — 1 
and there is another jet in the event, for integer m and 
n. 

Sleuth examines the variable 

J2pT^\ \PT^\\ + \PTuncl\ + \M , (1) 

where the first term is the scalar sum of the trans- 
verse momentum of all reconstructed objects, the sec- 
ond term is the transverse momentum of energy visible 
in the detector but not clustered into reconstructed ob- 
jects, and the third term is the transverse component 
of any missing energy in the event. By definition, these 
quantities satisfy 

+i>T+ PTuncl = 0, (2) 

i 

a two-component vector equation in the spatial coor- 
dinates transverse to the axis of the colliding beams. 
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In addition to the fact that many new physics sce- 
narios involve new massive resonances decaying into 
Standard Model objects resulting in events with large 
J2pt, the variable J2pt is useful in practice since it 
is easily defined for any final state, it is fairly insensi- 
tive to soft physics, including the details of final state 
parton showering, and its expected distribution can 
be easily populated on the high tails with additional 
Monte Carlo events by running event generators with 
a (slightly lower) threshold on generated J2pt- 

In each final state. Sleuth considers regions stretch- 
ing from each data point up to infinity in the distribu- 
tion oi^pT- For each region, the Poisson probability 
(p- value) that the Standard Model background in that 
region would fluctuate up to or above the number of 
data events observed in that region is calculated. The 
most interesting region in each final state is the one 
with the smallest p-value. Sleuth next produces en- 
sembles of pseudo data events drawn from the 
distribution of the Standard Model background, and 
for each set of pseudo data events finds the most in- 
teresting region and corresponding p-value. The frac- 
tion (V) of this ensemble of pseudo experiments in 
which the most interesting region corresponds to a p- 
value smaller than that of the most interesting region 
observed in the actual data is determined. The value 
V, a number ranging between zero (very interesting) 
and unity (not interesting), quantifies the probability 
that a random fluctuation of the Standard Model back- 
ground would create any region more interesting than 
the most interesting region observed in the data in that 
particular final state. In each final state considered in- 
dividually, V is the relevant measure of the statistical 
significance of the most interesting region on the tail of 
the J2pt distribution. This process is repeated for all 
Sleuth final states populated by three or more data 
events. 

The most interesting final state is identified as the 
final state with smallest V, denoted by Vmin- An en- 
semble of pseudo experiments involving all final states 
is then used to find the fraction {V) of these pseudo ex- 
periments that would produce one or more final states 
more interesting than the most interesting final state 
in the data. The number V, which ranges between zero 
(very interesting) and unity (not interesting), quanti- 
fies the interest of the final state with the most inter- 
esting region, accounting for the trials factor associ- 
ated with having examined many final states. 

Sleuth's discovery threshold is P < O.OOI. The 
trials factor is such that V at this threshold typically 
corresponds to a region with p-value of about I0~^, 
motivating the default criterion of 5ct typically em- 
ployed in our field. 



3 Sensitivity 

Sensitivity studies are performed to estimate Sleuth's 
sensitivity to possible new physics in 927 pb~^ of CDF 
data. Monte Carlo events corresponding to particular 
models of new physics, including specific parameter 
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Fig. 1. The Sleuth Wbbjj final state is dominated by 
Standard Model tt production. The total Standard Model 
prediction including tt production (dashed line) is in agree- 
ment with observed CDF data (black points). If the top 
quark were unknown, the Standard Model prediction sans 
tt production (red shaded histogram) significantly under- 
estimates the observed data. A global fit of the ViSTA cor- 
rection factors [T] has been unable to coerce the Standard 
Model prediction without tt to match the data in this final 
state. The (blue) arrow indicates the most interesting re- 
gion, identified by Sleuth. The interst of this region, con- 
sidering this final state alone, is found to be P < 8.3 x f 
(upper right), corresponding to a value of V easily satisfy- 
ing Sleuth's discovery threshold of O.OOf . 
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Fig. 2. Sleuth's P as a function of assumed integrated 
luminosity, where top quark pair production is removed 
from the Standard Model background estimate. The hori- 
zontal axis shows integrated luminosity, in units of pb~^. 
The shaded (yellow) band shows the range of values of 
V obtained when the integrated luminosity of the CDF 
data sample is reduced, with the width of the band result- 
ing from individual CDF top quark candidate events being 
randomly retained or discarded. The median V obtained 
is shown as the solid (green) line. The Sleuth discovery 
threshold V < 0.001 is shown as the horizontal dashed 
(gray) fine. 
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points within the MSSM and several flavors of Z', are 
partitioned into Sleuth final states according to the 
objects reconstructed in these events, and are added 
to pseudo data drawn from the Standard Model pre- 
diction. Sleuth is run "blind" on the resulting mix of 
pseudo data and trace amounts of pseudo signal, not 
knowing anything about the pseudo signal that has 
been added. The amount of pseudo signal is gradually 
increased until Sleuth identifies a region of interest 
corresponding to 'P < 0.001. 

The models tested require a cross section of roughly 
a few picobarns to reach Sleuth's discovery threshold 
in 927 pb~^ of data. Sleuth's sensitivity is found to 
be comparable to that of targeted searches for models 
that sufficiently satisfy the three assumptions on which 
Sleuth is based. Sleuth's sensitivity to unspecified 
new physics is in general significantly greater than any 
specific targeted search, due to Sleuth's noticeably 
broader scope. 

Specific Standard Model processes are also used to 
examine Sleuth's sensitivity. If the top quark were 
unknown, its production at the Tevatron would have 
been caught using Sleuth. Observation of ti produc- 
tion in 927 pb"'^ is shown in Fig. [1] An integrated 
luminosity of roughly 80 pb^^ would be required to 
reach the Sleuth discovery thresholcQ, as shown in 
Fig. [21 Standard Model WW production is also easily 
observed. Sleuth has more difficulty with single top 
quark and Higgs boson production, which only par- 
tially satisfy Sleuth's assumption that new physics 
will appear at large ^ pt relative to (other) Standard 
Model processes. 
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Fig. 3. The most interesting Sleuth final state {bb). Filled 
circles (black) show CDF Run II data; the (red) histogram 
shows the standard model prediction, with jj standing for 
two non-& generated partons, and bj for at least one gener- 
ated fe-quark. The horizontal axis shows the summed scalar 
transverse momentum of all objects in each event in the his- 
togram, with the number of events per 20 GeV bin shown 
on the vertical axis. The region Sleuth determines to be 
most interesting is shown with the (blue) arrow to be the 
region corresponding to > 469 GeV. A magnified 

view of this region in logarithmic scale is shown in the in- 
set, together with the number of events predicted (SM) and 
the number of events observed (d). Taking into account the 
trials factor associated with looking at many different re- 
gions. Sleuth determines the fraction of hypothetical sim- 
ilar experiments that would produce something in this final 
state as interesting as the region shown to be P = 0.0055 
(number at upper right). 



4 Results 

In 927 pb^^ of CDF Run II data, the most interest- 
ing region is observed in the bb final state, shown in 
Fig- El The fraction of pseudo experiments in this final 
state alone that would produce a region more interest- 
ing than the one observed \sV = 0.0055. Taking into 
account the many final states considered, the fraction 
of hypothetical similar CDF experiments that would 
observe a final state with V < 0.0055 is 7^ = 0.46. 

Unfortunately, with V = 0.46 > 0.001, Sleuth 
has not indentified any final state containing a region 
at large '^pt with a significant excess of data above 
Standard Model prediction. 



5 Conclusion 

These proceedings have described Sleuth, a quasi- 
model-independent search algorithm designed to iden- 
tify an excess of data above Standard Model predic- 
tion at large summed scalar transverse momentum in 

^ When comparing to the 67 pb~^ needed for discovery 
in CDF Run I W , differences such as the improved CDF II 
detector should be kept in mind. 



any final state. A specific definition of regions con- 
sidered allows Sleuth to rigorously calculate the tri- 
als factor associated with looking in many different 
places. Sensitivity tests performed indicate Sleuth is 
broadly comparable to targeted searches for specific 
models adequately satisfying the three assumptions on 
which it is based. Sleuth is expected to be signifi- 
cantly more sensitive to unspecified new physics than 
any particular targeted search, due to Sleuth's no- 
ticeably broader scope. Application of Sleuth in as- 
sociation with Vista [F to the first 927 pb^^ of CDF 
II data has unfortunately revealed no indication of new 
physics. This analysis represents one of the most en- 
compassing searches so far conducted for new physics 
at the energy frontier. 

This result does not prove that no new physics is 
hiding in Tevatron data; merely that this particular 
global analysis strategy has not yet revealed a discrep- 
ancy on which a discovery claim can be based in the 
927 pb^^ so far analyzed. A global analysis of more 
recently collected CDF data is in progress. 
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